Picture for Haibo Zhang

Haibo Zhang

Head-Pose-Aware Visual Speech Recognition with FiLM Modulation

Add code
May 30, 2026
Viaarxiv icon

How Reliable Are Semantic-ID Tokenizer Comparisons in Generative Recommendation?

Add code
May 25, 2026
Viaarxiv icon

CAST: Modeling Semantic-Level Transitions for Complementary-Aware Sequential Recommendation

Add code
Apr 21, 2026
Viaarxiv icon

Orchestrating Tokens and Sequences: Dynamic Hybrid Policy Optimization for RLVR

Add code
Jan 09, 2026
Viaarxiv icon

Each Prompt Matters: Scaling Reinforcement Learning Without Wasting Rollouts on Hundred-Billion-Scale MoE

Add code
Dec 08, 2025
Viaarxiv icon

Towards Reliable Evaluation of Large Language Models for Multilingual and Multimodal E-Commerce Applications

Add code
Oct 23, 2025
Viaarxiv icon

LLM-OREF: An Open Relation Extraction Framework Based on Large Language Models

Add code
Sep 18, 2025
Viaarxiv icon

Compass-Thinker-7B Technical Report

Add code
Aug 12, 2025
Figure 1 for Compass-Thinker-7B Technical Report
Figure 2 for Compass-Thinker-7B Technical Report
Viaarxiv icon

Optimal Transport-Based Token Weighting scheme for Enhanced Preference Optimization

Add code
May 24, 2025
Viaarxiv icon

Leveraging Generalizability of Image-to-Image Translation for Enhanced Adversarial Defense

Add code
Apr 02, 2025
Figure 1 for Leveraging Generalizability of Image-to-Image Translation for Enhanced Adversarial Defense
Figure 2 for Leveraging Generalizability of Image-to-Image Translation for Enhanced Adversarial Defense
Figure 3 for Leveraging Generalizability of Image-to-Image Translation for Enhanced Adversarial Defense
Figure 4 for Leveraging Generalizability of Image-to-Image Translation for Enhanced Adversarial Defense
Viaarxiv icon